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Abstract — Sensor networks potentially feature large numbers 
of nodes. The nodes can monitor and sense their environment 
over time, communicate with each other over a wireless network, 
and process information that they exchange with each other. They 
differ from data networks in that the network as a whole may 
be designed for a specific application. 

We study the theoretical foundations of such large scale sensor 
networks. We address four fundamental organizational and 
operational issues related to large sensor networks - connectivity, 
capacity, clocks and function computation. 

To begin with, a sensor network must be connected so that 
information can indeed be exchanged between nodes. The connec- 
tivity graph of an ad-hoc network is modeled as a random graph 
and the critical range for asymptotic connectivity is determined, 
as well as the critical number of neighbors that a node needs 
to connect to. Next, given connectivity, we address the issue of 
how much data can be transported over the sensor network. We 
present fundamental bounds on capacity under several models, as 
well as architectural implications for how wireless communication 
should be organized. 

Temporal information is important both for the applications of 
sensor networks as well as their operation. We present fundamen- 
tal bounds on the synchronizability of clocks in networks, and 
also present and analyze algorithms for clock synchronization. 
Finally we turn to the issue of gathering relevant information, 
that sensor networks are designed to do. One needs to study 
optimal strategies for in-network aggregation of data, in order 
to reliably compute a composite function of sensor measurements, 
as well as the complexity of doing so. We address the issue of 
how such computation can be performed efficiently in a sensor 
network and the algorithms for doing so, for some classes of 
functions. 

Key Words: Sensor Networks, Random Networks, Large Scale 
Networks, Connectivity, Capacity, Clock Synchronization, In-network 
Information Processing, Function Computation, Communication 
Complexity, Zero-error Information Theory. 

I. Introduction 

Wireless sensor networks are composed of nodes with sensing, 
wireless communication, and computation capabilities. They can 
potentially deploy large numbers of sensors. The sensors themselves 
can measure the environment over time and generate data, from 
which we seek to extract relevant information. The sensor nodes 
communicate with their neighbors wirelessly, and cooperate with each 
other in processing the data. 

Sensor networks therefore feature a combination of many func- 
tionalities - sensing, communication and computation, which are 
potentially exercised over large numbers of nodes. In this paper, we 
study the fundamental properties of such large scale sensor networks. 

Since sensor networks assess the environment based on interacting 
with each other, a fundamental global property of the network 
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that is of interest is connectivity, which ensures that all nodes can 
communicate with each other over multiple hops. We study how it can 
be achieved through local properties of nodes. Modeling the locations 
of sensor nodes as randomly distributed over a domain, we study how 
a connected sensor network can result from choices of communication 
ranges or by choices of neighborhood sizes, made locally by nodes. 

Nodes in large scale sensor networks communicate with each other 
wirelessly. It is of interest to understand how the information that 
can be communicated scales with the number of nodes. We present 
different models for doing so, at different modeling granularities. We 
also address the issue of the architecture of the wireless network that 
can facilitate data transfer. 

In sensor networks, the goal is often not to download all the 
measurements of all the nodes, but only to obtain certain concise 
functions of the data. For this purpose, one can exploit the compu- 
tational capabilities of the nodes to process information as it flows 
through them so as to efficiently deliver only what is sought by the 
application for which the sensor network has been designed. It is 
therefore of interest to determine how to efficiently conduct such in- 
network information processing. We formulate and analyze this issue 
as one of function computation over a wireless network and present 
results on how this can be done efficiently, and what rates of data 
aggregation are possible for a large network. 

The phenomenon being monitored by a sensor network is often 
time-varying, and so it is important to time-stamp events accurately. 
In some problems, such as localization, the accuracy of time-stamps 
is critical to the accuracy of the inference. This requires clock syn- 
chronization over the network, which is also important to operating 
the network efficiently, for example in synchronizing wake and sleep 
cycles or in scheduling other events. More generally, sensor networks 
are cyberphysical systems, where the notion of time is important 
for the physical system. We address fundamental issues related to 
what can and cannot be estimated vis-a-vis time and delays. We also 
present algorithms for clock synchronization and delay estimation, 
and study their properties for large networks. 

A more complete understanding of fundamental issues that arise 
in large scale sensor networks can potentially provide a strong 
theoretical foundation for sensor networks, that can inform network 
designers about optimal design and operation of this new technology. 

In Section [II] we study the key property of connectedness. We 
study the connectivity of a randomly deployed ad-hoc network 
using geometric random graph models and find the critical range 
beyond which the graph is connected with high probability. We also 
characterize connectivity based on the number of neighbors of each 
node, which permits the use of local logical variables to ensure the 
global property of connectivity. 

In Section [III] we present sharp order bounds on the amount of 
information that a wireless network can transfer under two distinct 
interference models, namely the protocol and physical models. We 
characterize the transport capacity measured in bit-meters/sec, where 
we say that the network transfers one bit-meter/sec, if one bit 
of information is transferred one meter closer to its destination 
within one second. In the case of a random network, we define 
the throughput capacity to measure the achievable throughput per 
node (in bits/sec). We also show how network information theory 
can be used to study the capacity of planar networks, and establish 
fundamental connections between the physical propagation properties 
of the medium and the resulting capacity of the network. We show 
the critical role played by signal attenuation parameters, absorption 
constant and path loss exponent, and provide insights into order 
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optimal architectures. 

The next issue we address, in Section [rv] is time. We present dif- 
ferent possible notions of synchronization in a network. An important 
role is played in clock synchronization by the fact that link delays 
are unknown in a wireless network. We study the synchronization of 
affine clocks, and present a fundamental impossibility result for the 
case of unknown asymmetric link delays, as well as characterize the 
uncertainty set in the parameter space. We present and analyze a fully 
decentralized scheme based on spatial smoothing of the estimated 
time differences between pairs of clocks, and analyze its convergence 
rate for different classes of graphs. 

In Section [V] we present the outline of a theory of in-network 
computation, where we study optimal strategies for in-network ag- 
gregation of data, in order to compute a function reliably. The goal is 
to enhance application performance by focusing on the joint operation 
of communication and computing in a sensor network. We consider 
function computation in tree graphs, and present a zero-error block 
computation approach that is optimal. In Section, |V-B| we classify 
symmetric functions into type-threshold functions, exemplified by 
Max or Min, and type-sensitive functions, exemplified by Average. 
We present order-optimal strategies for computing both classes of 
functions in multihop networks. In Section |V-C| we provide exact 
results for computing Boolean functions in collocated networks, 
using tools from information theory and communication complexity. 
The most general information theoretic formulation of this problem 
presents formidable problems which have remained unsolved even 
for simple networks. We review existing results from information 
theory, and present some interesting results regarding computation 
over noisy channels. 

II. Connectivity of wireless networks 

From the viewpoint of communications, sensor networks are 
wireless ad-hoc networks formed by nodes which communicate with 
neighboring nodes over a wireless channel. Nodes in the network 
cooperate in sending data concerning each other. An important 
desirable global property of such a wireless ad-hoc network is that it 
is connected, i.e., it constitutes a connected graph. This is particularly 
important in sensor networks, where achieving a common application 
objective may require communication among all the nodes. An 
important example is when some information concerning all the 
sensor nodes needs to be collected by a designated fusion node. 

One way to obtain a point-to-point communication link from a 
node i to a node j, is to ensure that the ratio of the received signal 
from i to the noise at node j is greater than a certain threshold. Given 
the power at which node i transmits and the attenuation properties of 
the medium as well as an upper bound on interference and noise, one 
can calculate if node j can successfully receive node i's transmission. 
This gives rise to an undirected connectivity graph with the set of 
nodes as vertices, where an edge (i, j) is present if node j is reachable 
by node i. A node can increase its range and its set of neighbors 
by increasing its transmit-power level. For uniform operation and 
analysis of a large number of nodes, an abstraction is to suppose that 
all nodes pick a common transmit range, and consider the problem of 
determining the critical range at which each node needs to transmit 
so as to guarantee asymptotic connectivity of the network. 

Since ad-hoc networks have arbitrary topology, the corresponding 
connectivity graph is modeled as a random graph. There are two 
important random graph models. 

Erdos-Renyi Graphs - B(n,p(n)) is a probabilistic graph consisting 
of n nodes, in which edges are chosen independently and with 
probability p(n). The critical probability for asymptotic connectivity 
of Erdos-Renyi graphs, see [ 1 1, is given by the following: 

Theorem 1. If p(n) = log "+ c( " ) , then the probability that 
B(n,p(n)) is connected converges to one as n — > oo if and only 
if c(n) — > +oo. 

In an Erdos-Renyi random graph, the event that there is an edge 
between i and k is independent of the event that there are edges 



between i and j, and j and fc. However, this is not generally true in 
an ad-hoc network, since it does not capture the loss of connectivity 
in a network when nodes are far from each other. Therefore one turns 
to graphs where the presence of a link between two nodes is based 
on the distance between the nodes. Such a modeling approach can 
be traced back to Gilbert [4| who may be regarded as a pioneer of 
continuum percolation theory l5l . 

Random Geometric Graphs - Let V be a disk in 1Z 2 having 
unit area. Let G(n, r(n)) be the graph formed when n nodes are 
placed uniformly and independently in T>, and two nodes i and j are 
connected by an edge if the distance between them is less than r(n). 
Then the problem is to determine the range r(n) of transmissions 
which guarantees the probability that Q(n,r{n)) is connected goes 
to one as n — * oo. This problem is studied in |2| and (3): 

Theorem 2. // nr 2 (n) = logn + c(n) , then the probability that 
Q(n,r{n)) is connected converges to one as n —* oo if and only 
if c(n) — > +oo. 

A thorough treatment of random geometric graphs can be found 
in (3). 

Connectivity is a global property of the network, and it is of 
interest to determine if there are other local properties, besides range, 
that can also be used to achieve connectivity in a wireless network. 
One such quantity is the number of neighbors of a node, which has 
the desirable feature that it is a logical property that can be checked 
locally by each node. The following result was established in l6l : 

Theorem 3. Let Q{n,(f)n) be the network formed when each node 
is connected to its 4>n nearest neighbors, i.e., there is an edge 
if either i or j is one of the <f>„ nearest neighbors of the other. Then 
there exist constants C2 > c\ > such that Q(n,(f> n ) is connected 
(resp. disconnected) with probability approaching one as n — > +oo 
if (f>„ > c-2 log n ( resp. (/> n < ci log n). 

The constants are refined in |7|, where it is shown that C2 can be 
chosen smaller than one, indicating that fewer neighbors are needed 
for connectivity based on number of neighbors than range. 

III. Capacity of Wireless Networks 

Unlike wireline networks, a wireless network is a shared medium 
where transmissions interfere with one another. Two or more nodes 
can make concurrent successful transmissions provided there is no 
destructive interference at the receivers. Two popular interference 
models are the following. In the protocol model every active trans- 
mission from a node k to a node I creates an interference zone which 
is a disk of radius (1 + A)pki centered at node k, where p^i is the 
distance between nodes k and I, and A > 0.. Transmission from a 
node i to some node j is successful if it is not in the interference zone 
of any other active transmitter. In the physical model, a transmission 
from i to j is successful if the received signal-to-interference plus 
noise ratio (SINR) at j is higher than a given threshold, i.e., if 

SINR(j) := — > 0, (1) 

N + 2^kET\{i} Pkj 

where Pi is the power level of the i— th transmission, T is the set of 
concurrently active transmitters, is the ambient noise power level, 
a > 2 is the path loss exponent, and /? is the threshold for successful 
reception. Each transmitter i needs to satisfy a power constraint Pi < 

Pind ■ 

Consider a planar wireless network of n nodes deployed in a disc 
of fixed area A, where each node can transmit at a throughput of W 
bits/sec. 

As a measure of performance we can consider the transport 
capacity (Ct) of the network measured in bit-meters/sec, where we 
say that a network transports one bit-meter of information, if one bit 
of information is transferred one meter Q An alternative manner of 

' If we assume that the transport capacity is equitably divided, the per-node 
transport capacity is obtained by dividing the transport capacity by n. 
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assessing the performance is by supposing that each node randomly 
picks a destination from among the other nodes, and then determining 
the throughput capacity, which is defined as the largest common 
throughput that can be provided to each origin-destination pair, i.e., 
the max-min throughput. 

In (8) the dependence of the transport capacity on n was studied 
both for the case of arbitrary networks, as well as the asymptotic 
behavior as n — > oo of the throughput capacity for random networks 
in which n nodes are independently and uniformly distributed in a 
two-dimensional disc of area A. A throughput capacity A(n) is said 
to be feasible in this latter case if there is a spatiotemporal scheme for 
scheduling transmissions such that each node can send an average of 
A(n) bits/sec to its destination in a multi-hop fashion, with probability 
approaching 1 as n — > oo. 

The main results under the two interference models are summa- 
rized below d), GO 



Theorem 4. 

1) (Protocol model) For a network with 
placement as well as optimal choice 
destination pairs, Ct = Q{Wy/ An) 



for each n, 
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V n log n 

is not, for appropriate < 
oo. 
is 



hA)VA\/2+A 

2) (Physical model) For all networks, Ct = 0(W 
optimally designed network Ct = Q(WV An). 

3) (Protocol model) For a random network, A(n) 
feasible while X(n) — ,' 2 , u 

J ^ ' yti log n 

Ci < C2 < oo, both with probability approaching 1 as n — 

4) (Physical model) For a random network, A(n) = n 

feasible while A(n) = ^?=- is not, for appropriate ci, C2 both 
with probability approaching 1 as n — > oo. 

The constructive proof of capacity [ 8 1 shows that, in the protocol 
model, an optimal scheme is to group the nodes into small cells 
and designate one specific node per cell to relay multi-hop packets 
traversing the cell. It is also nearly optimal even if all nodes use the 
same transmission range (which is chosen to be just high enough to 
guarantee network connectivity). 

In [9 1 the transport capacity is shown to be Q(y/n), even under 
a generalized physical model where each node uses adaptive coding 
to attain a bit rate of Slog(l + SINR(i)). In fl51 . the feasible 
throughput for a random network is shown to be <9(^=) for the 
generalized physical model. 

In order to assess what are the ultimate limits to how much 
information wireless networks can carry and how they should be 
operated, one needs to turn to network information theory. In 1 1 1 
the capacity of planar networks was formulated and studied under 



a signal path loss attenuation characteristic of the form e 



8 > is the path loss exponent. It should be noted that generally 
there is always some absorption in the medium, so 7 > 1 16 1. Thus, 
the received signal at node j at time t is 



yi(t) = Y,dje " ,Pi *x i (t) + z j (t), 



(2) 



where Zj(t) is White Gaussian Noise (WGN) with variance a 2 . 
Above, pij is assumed to be no less than some p m in > 0. The 
signal transmitted by node i at time t can depend on all causally 
acquired information, i.e., the messages it wants to send to other 
nodes, say Wik £ Wife, for k 7^ i, as well as its past receptions 
yl '-~ {yi( s ) '■ s < t}. This allows for general cooperation strategies 

2 We use the standard notation due to Knuth. We write /(n) = 0(g(n)) 
if limsup n ^ +00 j^jj < ci for some c x < 00, f(n) = Q(g(n)) if 

liminfn—,+00 r-fejj > C2i for some C2 < 00, and f(ra) = G(g(n)) if 
both f(n) = 0(g(n)) and f(n) = n(g(n)). 



among nodes, and accounts for the power of information theoretic 
results when they are obtainable. 

For a network of n nodes there are n(n — 1) possible 
source-destination pairs (i,j),j 7^ i. Suppose that a rate Rij 
is supported for source-destination pair (i,j). There are power 
constraints of either the form Pi < Pi„d, for each node i, 
or X^ILi ^ — Ptotai- The transport capacity is defined as 

s "l>.-/< achievable rate vector) E^i" 1 ' ^iPv- where acmev " 

ability of a rate vector is defined in an information-theoretic sense 

tm 

It was shown in 1101 that for all planar networks with 7 > or 

8 > 3, C T < cl(l,S ; P i mln) Ptotai, From this it follows that 

vairi) -Pind 

Ct < 5 n, (i) 

which is essentially a Q(V An)— law, since the area itself grows like 
n in this case, whence 0(V 'An) — &(n). 

Concerning the regime 7 = 0, it was established in 1 1 1 that for 
S < ^,Ct can be unbounded even for fixed Ptotai- Moreover, to 
illustrate possibilities, for | < 8 < 1, super-linear scaling of the 
transport capacity, in place of |3](, was shown to be possible. Both 
results are achieved by a strategy of coherent multi-stage relaying 
with interference subtraction, which effectively shows that nodes 
can cooperate over large distances using coherence and multi-user 
estimation when the attenuation is low. 

The regime where the scaling of the form (pi holds was further 
extended in 1 1 1 1, where it was established that Ct = O(n) for S > 2. 
For the case where 7 = and 8 £ [1, 2], by using novel forms of 
cooperation between nodes, it was established in 1141 that A(n) > 
en 1 - 6 ", for 8 £ [1, §] and A(n) < for 8 £ [|,2], where A(n) 
is the per-node throughput for a random destination. These results 
suggest the potential of other forms of cooperation besides multi- 
hop relaying in this attenuation regime. The precise dependence of 
constants on n is investigated in 1 12]. 

IV. Clock Synchronization 

Distributed clocks generally don't agree. Yet, clock synchroniza- 
tion requirements are required for several applications in sensor 
networks. Applications include coordinating events in a decentralized 
system 1321 . tracking, surveillance, target localization [34], data 
fusion, scheduled operations like power-efficient duty-cycling which 
is especially important for low power sensor nodes operation,, as well 
as in sensor-actuator methods where loops are closed over networks. 
Slotted communication protocols for random Medium Access Control 
(MAC) 1361 as well as environmental monitoring applications also 
require accurate clock synchronization. More broadly, as we head 
towards the era of event-cum-time driven systems featuring the 
convergence of computation and communication with control, the 
need for well-synchronized clocks becomes increasingly important, 
affecting QoS, system performance and safety. 

A. Notions of network clock synchronization 

There are two possible goals of clock synchronization |18|,|33|. 

1) Ordering of events. The goal is to create a right chronology of 
the events in the entire network, where knowledge of the exact 
time instants is not required, yet an ordering of events that 
may occur at different nodes has to be determined. This was 
thoroughly studied in |21| where the notion of virtual clocks 
was introduced. 

2) Synchronization. The goal of synchronization is to estimate 
the relative time differences among a set of clocks in the 
network. This information can be then used to translate time- 
stamps from one clock to any other clock. It has the advantage 
that the translation mechanism does not create undesirable 
dependencies by resetting clocks in hosts 1201 , and gives rise 
to the notion of relative clocks. By selecting a particular node 
as a reference, one can even set all clock displays to agreement 
and attain a global definition of time. 
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B. Fundamental limits for synchronization of affine clocks 

Let us first consider the simplest model of affine clocks 1171 . 
Denote the time display of a fixed reference clock by t, and assume 
that the display Tj (t) of a clock j at time t, satisfies 

Tj (t) —ajt + bj. (4) 

Here a,j > 0, is called the skew, and bj is the offset of clock j at the 
time of the reference clock. 

IV. B. 1. Model for packet delay: Delays in packet delivery 
constitute a fundamental limitation in synchronizing clocks over 
wireless sensor networks since they can be much larger than the 
required synchronization precision. Suppose that a packet sent by 
node i is received by node j after a delay of dij time units (measured 
in the time units of the reference clock, clock 1). The delays {dij} 
are assumed to be unknown but fixed. In fact delay estimation is an 
important problem in its own right, as well as an intrinsic part of the 
problem of clock synchronization 1231 . 1241. 

By the word "delay" here is meant the sum of all delays incurred 
by a packet after it is time-stamped by the transmitter and before 
the time-stamp is read by the receiver. This includes (cf. [ 30 1 , [28 1, 
El): 

1) Transmission delay. This accounts for the processing time in 
the transmitter after time-stamping. 

2) Propagation delay. This can be estimated accurately say by 
GPS, or other position information, since it only involves the 
distance between the nodes. 

3) Receiving delay. This accounts for the processing time in the 
receiver before time-stamping. 

With the exclusion of the electromagnetic propagation delay, the 
other delays can depend on the communication and computation 
platforms of the nodes involved and the load experienced at them. 
Due to this heterogeneity, delays are generally not symmetric, i.e., 
dij 7^ dji, or identical between links. 

IV. B. 2. Pairwise clock synchronization: Denote the time (as 
measured bv the i-th clock) that node i sends its fc-th packet by s' fc ' 
(see Figure lb, and denote by r^* the time (as measured by the j-th 
clock) that node j receives the fc-th packet sent by node i. 
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Fig. 1. Message exchanges between two nodes 

It was shown in 1201 . 1171 that it is impossible to even synchronize 
just two clocks. 

Theorem 5. Even under bilateral exchange of an infinite number 
of packets between the two nodes j and 1, estimation of the entire 
four-tuple (a,j,bj,dij,dji) is impossible. In particular, 

1) The skew can be estimated correctly, even if there are only two 
one-way pings, i.e., the link is unidirectional. 

2) The vector (bj,dij,dji) T of offset and delays can only be 
determined up to a translate of a one-dimensional subspace of 
R 3 . 

3) The round-trip delay (dij + dji) can be estimated precisely. 

4) If we further use knowledge of causality, that packets cannot 
be received before they are sent, i.e., that dij > 0, and also 
that a,j > 0, then the uncertainty set for the offset reduces 
to an interval, whose length is proportional to the round-trip 
delay. 



If we assume that delays are symmetric in the two directions, the 
relative offset between the two clocks at the time of the receipt of 
the k— th packet from j to i,Tij(k) (see Figure [TJ can be estimated 
by 

hj{k) = -rf + («f +a ij d ji ), (5) 
where dji is the delay estimate given by 

:= ^[(rf-^H^f -r^)0-&ji)]. (6) 

TV. B. 3. Network clock synchronization: Now consider the 
problem of synchronizing clocks over a network. Consider a network 
of n nodes represented by a graph G = (V, E) which contains 
a directed edge from (i,j) if i can send packets to j. Node 1 is 
considered to be the reference node. In the case of a link 
we call the ratio of the skews — as the relative skew, and the 

a i 

quantity bj f 6i as the relative offset between those two clocks. It 

is possible to determine the relative skew ^f by use of two packets 

r! fc + 1) -r< fc+1) 

sent from node i to node j, e.g., by — ''( fc) . The relative 

skew between two non-communicating nodes can be computed by 
multiplying the relative skews of the links across a directed path 
connecting those two nodes. In fact, every node in the network can 
estimate all nodal skews if and only if the graph is strongly connected 

03- 

The following theorem |T7:| establishes the result that without 
any further assumptions, clock synchronization is impossible in any 
network. 

Theorem 6. Consider a network of n nodes. It is impossible to 
determine all 2(n — 1) + \£\ unknown parameters {ai,bi, and dij 
for all i and all j 7^ i} even if all pairs of nodes can exchange 
any number of time-stamped packets containing any information that 
is causally known to the transmitter. Furthermore, if the graph is 
strongly connected then: 

1) All the skews {ai : 2 < i < n} can be estimated correctly. 

2) Every vector d = (dij ,(i,j) £ £ ) in the uncertainty set for the 
delay vector d = (dij , (i,j) £ £) can be expressed as a known 
affine transformation of (n — 1) variables {bi : 2 < i < n}. 
Each bi can be regarded as an estimate of the unknowns offset 
bi. Any choice of these estimates {bi : 2 < i < n} is consistent 
with all transmit and receipt time-stamps of all packets. 

3) If causality is invoked, i.e., delays are non-negative, the 
uncertainty set for the estimates of the offset parameters 
{bi : 2 < i < n} can be fully characterized as a compact 
polyhedron o/M n_1 . 

4) Suppose all links in £ are bilateral. Then the feasible polyhe- 
dron in [5] has a non-empty interior if and only if there is no 
bidirectional link with zero round-trip delay. 

C. Clock Synchronization algorithms 

We begin with the description of some well known clock synchro- 
nization protocols. In large networks where synchronization require- 
ments are not too stringent, e.g. Internet, the Network Time Protocol 
(NTP 1 26 1) has been used for over two decades. It is a hierarchical 
protocol with accuracy of the order of milliseconds |26|, obtained 
by synchronizing with external sources organized in a hierarchy 
of levels, called strata. Real-time applications in wireless sensor 
networks typically require precision in the order of microseconds. 
In several cases, e.g., indoors, densely populated downtown areas, or 
during solar flares, the GPS service may be unavailable. 

For more accurate synchronization in sensor networks and net- 
worked control a variety of algorithms have been suggested. Ref- 
erence Broadcast Synchronization (RBS) 1271 is a receiver-receiver 
synchronization algorithm, which exploits the broadcast nature of 
the wireless medium. Nodes broadcast packets without any time- 
stamping on the transmitter side. The nodes that receive the transmit- 
ted packets record the reception times and exchange them with their 
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neighbors, so as to estimate their clock difference, by assuming that 
the one-way delays are the same for neighboring nodes. This com- 
pletely eliminates the transmitter-side non-determinism, and accuracy 
depends mainly on the difference of receiving delays. This scheme 
was tested in actual sensor networks comprising of Berkeley motes 
and achieved precision within 11 [is [27 1 . 

It was established in [IV] that all nodal skews are still deter- 
minable in the receiver-receiver synchronization scenario under some 
assumptions on the graph topology. However, the uncertainty set is a 
translation of (2n— 1)— dimensional subspace and neither round-trip 
delays are determinable nor causality can be exploited (cf. Theorem 
10, El). 

Timing-Sync Protocol for Sensor Networks (TPSN |29|) is a 
sender-receiver synchronization protocol which uses time-stamping at 
the MAC layer to eliminate the transmission delay which is typically 
the most variable term in wireless sensor networks. In 1 29 1, it was ob- 
served and verified by simulations that TPSN achieves approximately 
twice the accuracy of RBS for pairwise synchronization. In [28 1, 
authors identify further sources that contribute to packet delivery de- 
lay and propose the Flooding Time Synchronization Protocol (FTSP 
1 28 1). FTSP uses hardware solutions and efficient time-stamping to 
eliminate all packet delay factors but the propagation delay, and 
linear regression to compensate for clock drifts; the precision that 
it achieved was measured in the order of 10 /is for a network with 
several hundreds of nodes. 

Network-wide offset estimation: Given estimates of relative 
offsets for all links, i.e., between pairs of neighboring nodes, the 
network-level goal of synchronization is to obtain an estimate of all 
nodal offsets with respect to the reference node [22], |23|, [24], [25 1. 

Given relative offset measurements with some known variance, 
the minimum variance unbiased linear estimate of the pairwise offset 
differences between any two nodes has been derived in 1221 . It 
is shown that its variance is equal to the effective resistance between 
those two nodes, in an electric network where each link's resistance 
equals the variance of the relative offset measurement. An iterative 
algorithm to obtain the minimum-variance nodal offset estimates is 
also derived. 

A decentralized asynchronous algorithm based on spatial smooth- 
ing of pairwise estimates has been developed and implemented in 
1 23 1, and comparative evaluations have been performed. The accuracy 
achieved was 2fis for pairwise synchronization, and 20/j.s for a 
network of 40 nodes, in which FTSP achieved an average accuracy 
of 30/is. In [24) the convergence of a synchronous version of 
this method was studied, and error asymptotics for various graph 
topologies were derived. A similar scheme which uses the Jacobi 
algorithm to obtain an iterative solution to a set of linear equations 
has been derived in 1251 . 

We summarize the spatial smoothing algorithm of |23|, |24|. In a 
network of n + 1 nodes, where node is the reference node, denote 
the directed graph of links across which state estimates are available 
at some given time by G — (V, £). Also denote the reduced incidence 
matrix of the graph obtained from the incidence matrix by removing 
the row corresponding to node 0, by A , and assume that the graph 
is connected. Then AA T is the principal submatrix of the Laplacian 
of the graph, which is known to be positive definite for connected 
graphs, and hence invertible. Denote the relative offset on link 
by Tij := tj — n, where Ti,Tj represent the local times of clocks 
i,j, respectively. We have 

o = A T v, (7) 

where o = {t^} G R' £ ' is the vector of relative offset estimates, 
A G K' £ ' xn , v = {ri}™ G R". We can formulate the network- wide 
estimation problem as a least squares problem with error criterion 

F(v) := \\d- A T v\\ 2 . (8) 

The solution is v = (AA T )~ 1 Ad. If we further assume that each 
link estimate is associated with a variance, then the nodal estimate 
Vi has variance equal to the effective resistance between node i and 



node 0, in an electric network where each undirected link has a 
resistance equal to the variance of its measurement. An application of 
the latter result, combined with circuit theory, allows us to determine 
the maximum error variance asymptotics for different graphs [24|. 
In a tree network, the error variance is proportional to the diameter, 
while in a complete graph it decreases like 0(l/n). In a lattice, the 
asymptotic growth rate is O(logn). What is very interesting, and 
provides theoretical support for the feasibility of clock synchroniza- 
tion in planar wireless networks, is that the synchronization error can 
be kept bounded even in networks of large size 1 24 1 

Theorem 7. Consider n nodes located randomly (uniformly and 
independently) on a planar disk. Suppose that all nodes choose a 
common range that ensures connectivity (see section [77J. Then, the 
error variance is bounded as the number of nodes n —* oo. 

To avoid centralized computation of the pseudo-inverse operator 
(j4A t ) _1 A, a distributed iterative implementation was derived in 
1 23 1, [24] as follows: Setting the i — th derivative of F(v) equal to 
0, yields (AA T )iV — Aid = 0, and a straightforward analysis |24| 
shows that coordinate descent gives rise to an asynchronous scheme 
where node i updates its estimate Vi according to 

Vi = j (9) 

1 j:(i,j)££ Or (j,i)£S 

where di, denotes the total degree of node i. Note that the scheme 
is fully distributed and has the further advantage that it uses no 
information about the network topology. 

Above we have studied the asymptotic accuracy achievable in 
clock synchronization over various graphs. Next, we turn to the 
rate of convergence. In order to study convergence time of J51, 1241 
considered a synchronous implementation of the form 

u fc+ i = (I~D- 1 AA T )v k , (10) 

where D is the diagonal matrix of nodal degrees, and Vk := Vk — 
(AA T )~ 1 Ad. The convergence rate of the synchronous scheme is 
completely characterized by the spectral radius p of M := (I — 
D' 1 AA T ). Bounds on p(M) were derived in | 24| by an application 
of Cheeger's inequality 1371 : 

Theorem 8. The spectral radius p(M) satisfies: 

1 - 2=^- < p(M) < 1 - (=^— ) 2 , (11) 

where k is the edge-connectivity of the graph. In particular, the 
settling time to an e— neighborhood of the final value, for both lattice 
and random planar graphs, is 0(n 2 ). 

V. A Theory of In-Network Computation 

Sensor networks should be distinguished from general wireless 
ad-hoc networks. Traditional data networks are only concerned with 
end-to-end information transfer, but sensor networks may only be 
interested in gathering certain aggregate functions of distributed data. 
For example, one might want to compute the average temperature 
for environmental monitoring, or the maximum temperature in fire 
alarm systems. In this context, communicating all the relevant data to 
a central collector node, which subsequently computes the function, 
might be inefficient, since it requires excessive data transfer or energy. 
Sensor nodes are severely limited in terms of power and bandwidth 
and it becomes necessary to find optimal in-network aggregation and 
communication strategies for efficient function computation. Such in- 
network processing and aggregation is made possible by the fact that 
sensor networks are often application specific, that is, deployed to 
achieve a specific goal, and hence nodes may look into the contents 
of packets and create new packets or discard others. This is in sharp 
contrast to data networks where nodes only process the headers of 
the packets, but never the contents of the payload of the packets. 

A fundamental challenge therefore is to exploit the structure of 
the particular function of the data that is of interest, so as to 
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optimally combine transmissions at intermediate nodes. Thus the 
problem of function computation is more complex than finding 
the capacity of a wireless network, since the traditional decode 
and forward model does not capture the possibility of combining 
information at intermediate nodes. The general problem of computing 
a function of correlated data, over a distributed network of nodes 
with wireless links, admits a variety of approaches. It is related to 
fundamental problems in multi-terminal information theory [47| [48 1, 
distributed source coding, communication complexity and distributed 
computation. In this section, we highlight the major approaches, and 
the hierarchy of ideas within. 

A. Block computation of functions in multi-party networks 

Consider a general network of n nodes, with a designated collector 
node to which the value of the aggregate function of interest needs to 
be communicated. Each node has a certain transmission range, and 
can transmit directly to any other node within that range, provided 
no other transmission is interfering. Two networks of interest are the 
collocated network, where the connectivity graph is complete; and the 
random multihop network where the connectivity graph is a random 
geometric graph (see Section [Hp. This problem was studied in [39 1, 
to determine the limits of what can be achieved even after allowing 
for block computation. Nodes make a block of N measurements, 
and the collector or fusion node seeks to compute the block of 
function values. Communicating a block of measurements has been 
shown by Shannon [40 1 to be critical to achieving reliability with 
high throughput. Analogously, the measure of efficiency in sensor 
networks is the computational throughput, which is defined to be the 
minimum number of time units required per computation, over all 
schemes, and over all block lengths. 

In sensor networks, we are often interested in symmetric functions 
which only depend on the data of a sensor, not its identity. These 
are defined as functions whose values remain unchanged even if 
the values of its arguments are permuted. They include statistical 
functions like mean, median, maximum/minimum, and others which 
are completely determined by the histogram of the set of node 
measurements. A key property is that the histogram of two disjoint 
sets can be combined to give the histogram of the union. This suggests 
a divide-and-conquer algorithm where a spanning tree is constructed, 
and partial histograms are propagated from children to parents up the 
tree towards the root where the collector is located. Further, one can 
achieve optimal spatial reuse of the wireless medium, by choosing 
the transmission range appropriately, and pipelining the transmissions 
(|39), Thm. 1). 

Theorem 9. For a connected network with n nodes and maximum 
degree O(logn), the computational throughput is 6( lo n )< l n par- 
ticular, for the random network on a unit area square, if the common 

transmission range is chosen to be 0(y ^^), then one can achieve 

a computational throughput Q ( lo * - ) with high probability. 

The lower bound Q( lo 1 n ) follows from the fact that representing 
the histogram itself requires 0(logn) bits, and the collector node 
can receive only a bounded number of bits per slot. An important 
consequence of this result is that the natural method of tree aggrega- 
tion is order-optimal for symmetric function computation in random 
networks. 

For block computation of a general function of correlated measure- 
ments, one needs to find a quantity analogous to the histogram, which 
can be composed across disjoint subsets of nodes. Towards this end, 
one can abstract out the medium access control problem associated 
with a wireless network, and view the network as a directed graph 
with edges representing essentially noiseless wired links between 
nodes. Thereby, we can focus on strategies for combining information 
at intermediate nodes, and optimal codes for transmissions on each 
edge. 

We begin by considering the simple two node problem. Suppose 
nodes vx and vy have measurements x G X and y G y, where 



the alphabets X and y are finite sets. Node vx needs to optimally 
communicate its information to node vy so that a function f(x,y), 
which takes values in T>, can be computed at node vy with zero error. 
One can consider both worst-case and, if a probability distribution 
over inputs (x, y) G X x y is specified, average case performance 
metrics. It is easy to see that any feasible encoder at vx must 
necessarily separate x\ and X2 if there exists y* G y such that 
f(xi,y*) 7^ f(x2,y*)- In fact, one can show |41| that the optimal 
encoder is one which greedily combines inputs in X that need not 
be disambiguated to obtain a reduced alphabet, and then assigns a 
distinct codeword to each input in the reduced alphabet. This specifies 
the optimal encoder in the worst-case scenario. 

In the average case scenario, any feasible encoder at vx must 
separate x\ and X2, if there exists y* G y such that f(xi,y*) f^ 
f(x2, y*) and p(x lt y*)p(x 2 ,y*) > 0. In the case where p(x, y) > 
for all x and y, the optimal encoder is again obtained by greedily 
combining inputs that need not be disambiguated and then applying 
the Huffman code on the reduced alphabet. If on the other hand, the 
probability distribution does have 0s, finding the optimal encoder is 
NP-Hard, but the asymptotic per-instance complexity is a quantity 
called the graph entropy Hq(X), as shown in 1421 . 

Interestingly, this idea can be extended to obtain optimal in- 
network aggregation strategies on directed tree graphs [41 1. Since 
each edge is a cut-edge, one can find the disambiguation requirements 
for each edge, assuming all the nodes at either end of the edge 
are collaborating. Further, one can define encoders at each node, 
recursively and in a way that is consistent, and show that the above 
necessary condition is indeed sufficient. 

Consider a directed tree graph with the collector as the root node. 
Let Ti denote the subtree with node i as root. The encoder at an 
intermediate node i greedily combines input combinations of nodes 
in Ti that need not be disambiguated, to obtain a reduced alphabet. In 
the worst-case scenario, the encoder at node i assigns a fixed length 
codeword to each element in the reduced alphabet. In the average- 
case scenario, the encoder at node i constructs a Huffman code on 
the reduced alphabet. The decoder at an intermediate node i assigns 
a nominal input value to the nodes in its subtree T, based on the 
received transmissions 1411 . 

Theorem 10. For a directed tree graph, the above coding strategy 
achieves zero-error function computation, and simultaneously mini- 
mizes the number of bits transmitted on each edge, both in the worst- 
case scenario, and in the average case scenario when p(x, y) > 0. 

The extension to directed acyclic graphs presents some challenges. 
A key difference from the tree case is the presence of multiple paths 
to route the data, which present different opportunities to combine 
information at intermediate nodes. One can derive an outer bound to 
the rate region by finding the disambiguation requirements for each 
cut of the directed graph. This outer bound is not necessarily tight. A 
natural achievable strategy is to activate a subset of edges constituting 
a tree and then apply the optimal strategy for tree aggregation. The 
tree rate points are extreme points of the rate region. By time-sharing 
between different trees, one can achieve any rate point in the convex 
hull of the tree rate points. However, this does not match the outer 
bound for even simple examples . 

Counter Example 1 (Arithme tic Su m). Consider three nodes v\, 
V2, 1)3 connected as in Figure 2(a) Let Xi = X3 = {0, 1}, with 
node Vi having no measurements. Suppose node V\ wants to compute 
f(Xi,X 2 , A3) = X-2 + A 3 . Let (R21, R31, R32) be the rate vector 
associated with edges (Zi, I2, Z3). The outer bound on 1Z WC is: 

R21 > 1; R21 + R31 > log 3; R32 + R31 > 1. 
The subset of the rate region achievable by trees is: 
R21 = A + (l- A)log3,i? 3 i = A, #32 = (1 — A) for < A < 1. 

Suppose that X\, X2 are i.i.d. with p(Xi = 0) = p(X\ = 1) = 
0.5. The outer bound on TL avg is: 



R21 > 1; fl 2 i+-R 3 i>§; i?32 + i? 3 i>l. 
The subset of the rate region achievable by trees is: 
R21 = A + (1- A)f,i? 3 i = A,i? 3 2 = (l-A)/or0< A < 1. 
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(a) Counterexample 1 (b) Parity in DAGs 

Fig. 2. Function computation in DAGs 



In some special cases, the cut-set outer bound can be achieved by 
aggregation along trees 1411: 

Theorem 11. For the worst case computation of finite field parity 
in To, and the maximum or minimum functions, the cut-set outer 
bound to the rate region is indeed tight. 

The idea is that every leaf node splits its block and transmits 
the segments on the outgoing edges from Vi (See Figure [2(b)) . Each 
intermediate node receives partial blocks from lower nodes, and 
can hence compute an intermediate parity/intermediate maximum for 
some instances of the block. It then splits this intermediate processed 
block along the various outgoing edges. Proceeding recursively up the 
DAG, we can achieve the cut-set outer bound. 

The coding strategies described above can be viewed as a so- 
phisticated form of network coding, which achieves efficient block 
computation. The multicast problem, where a source node wants to 
send the same message to multiple destinations, was studied in [43 1. 
For this problem, linear network coding is known to achieve the cut 
capacity 1441 . The function computation problem is a somewhat dual 
problem, where the collector wants to calculate a function of sensor 
measurements, and we seek optimal encoders. 

B. Classification of symmetric functions 

Among the class of symmetric functions, not all functions are 
equally hard to compute. For a function like Maximum, if a node 
knows that the maximum temperature recorded until now is at least 
100, then the node need not transmit any further reading unless it 
has a higher value. Thus the previous transmissions provide side 
information about the function value, even if the measurements are 
independent. On the other hand, for a function like Average, if even 
a single measurement is missing, we could have incorrect function 
computation, if the goal is zero-error block function computation. 
The results do change significantly if we allow a vanishing error as 
block length increases. 

In 1391 . the collocated network scenario is analyzed, where all 
transmissions can be heard by all nodes, and collisions do not convey 
information. Thus, one restricts attention to collision-free strategies 
where nodes can only transmit one at a time, with each node's 
transmission depending only on its data and previous transmissions all 
of which it has heard. Each node i has a measurement Xi taking values 
in a finite set X. Define x := (xi, X2, . . . , x n ), and let r(x) denote 
the type vector of length \X\. A symmetric function f(x) depends 
on x only through its type vector r(x). Two interesting classes of 
symmetric functions can be identified, namely type-sensitive and type- 
threshold functions. 

Definition 1 (Type-sensitive function). A symmetric function f(x) 
is said to be type-sensitive if there exists some 7 with < 7 < 1, 
and an integer n, such that for n > n, and any j < n — \"/n] , 
given any subset {x±, X2, . ■ ■ Xj}, there are two subsets of values 
{yj+i,y j+ 2, . . . j/n} and {z j+1 ,z j+ 2, . . . z„} such that 

f(x 1 ,...x j ,y j+1 ,...y n ) / f(x 1 ,...x j ,z ]+1 ,...z n ). 



For a type-sensitive function, a certain minimum fraction of 
the arguments need to be known for the function value to be 
determined. Instances of type-sensitive functions include Average, 
Median, Majority and Histogram. Counting arguments can be used 
to show that the maximum rate for computing type-sensitive functions 
in the collocated network is O(^). A trivial achievable strategy of 
rate fi(^) involves each node declaring its value. This implies that 
the class of type-sensitive functions is orderwise maximally hard to 
compute. 

Definition 2 (Type-threshold function). A symmetric function f(x) is 
said to be type-threshold if there exists a non-negative threshold vec- 
tor 6 of length \X\, such that f(x) = f'(z(x)) = f'(mm(T(x),9)), 
for all x £ X n , with rain signifying element-wise minimum. 

Instances of type-threshold functions include Maximum, Minimum 
and k th largest value. In the case of binary measurements, for 
instance, the value of a type-threshold function is determined if the 
number of Os/ls exceeds a fixed threshold. Loosely speaking, the 
value of a type-threshold function is entirely determined by certain 
outstanding measurements. For this class of functions, an exponential 
speedup is possible, and combinatorial arguments show that the 
computational throughput of this scheme is Q(j^rn)- 

The upper bound on the computational throughput in each case is 
obtained by generalizing the concept of fooling sets in communication 
complexity. Using this technique provides an upper bound on the 
throughput of all interactive protocols. We will revisit this idea in 
section [V-C] Having derived an order-optimal strategy for collocated 
subnets, one can proceed to quantify the computational throughput of 
random planar networks. Spatial reuse of the wireless medium leads 
to a exponential speedup, and we have the following theorem 1391 . 

Theorem 12. The computational throughput of computing a type- 
sensitive function in a random planar network is 6( lo ^ n ); for a 
type-threshold function, it is Q( , — r )• 

■ 1 J v log log n ' 

If we assume that the measurements are drawn independently and 
identically from some distribution, one can obtain an even higher 
computational throughput I 4T1 : 

Theorem 13. Suppose that the measurements Xi = 
(Xi t i,Xi t 2,...,Xi,N) are i.i.d. with p(X it i = 1) = p. Let 
f(X\, X2, ■ ■ ■ , X n ) be a symmetric type threshold function with 
threshold vector [0, 0]. Thus the value of the function depends on the 
number of Is, upto a threshold 9. The average case computational 
throughput for zero error block computation of the function f is 
6(1) bits. 

C. Computing Symmetric Boolean functions 

For certain special classes of functions, one can go further and seek 
exactly optimal strategies for computing functions in a collocated 
network. For the case of symmetric Boolean functions, this problem 
was studied in [45). 

Consider a collocated network with nodes 1 through n, where 
each node wants to compute the function f(X\, X2, ■ ■ ■ , X n ) of 
the measurements. We seek to find communication schemes which 
achieve correct function computation at each node, with minimum 
worst-case total number of bits exchanged. Each node i has a block of 
N independent measurements, and we restrict ourselves to collision- 
free strategies. Let Sn be the class of collision-free strategies for 
block length N which achieve zero-error block computation, and 
let C(/,Sn,N) be the worst-case total number of bits exchanged 
under strategy Sn G Sn- The worst-case per-instance complexity of 
computing a function f(Xi, X2, . . . , X n ) is defined by 



C(f) 



lim min 

at^oc s N es N 



C(f,S N ,N) 
N 



We call this the broadcast computation complexity of the function /. 

Before we can address the general problem of computing symmet- 
ric Boolean functions, we consider the specific problem of computing 
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the AND function, which is 1 if all its arguments are 1, and 
otherwise. The basic two node problem was studied in 1431 : 

Theorem 14. Given any strategy Sn for block computation of Xi A 

X 2 , 

C(Xi AX 2 ,S N ,N) > jVlog 2 3. 

Further, there exists a strategy Sjy which satisfies 

C(X 1 AX 2 ,S* N ,N) < [iVlog 2 3l. 

Thus, the complexity of computing X\AX 2 is given by C(X\ AX 2 ) = 
log 2 3. 

The lower bound is shown by constructing a fooling set |46| of 
the appropriate size. Consider two nodes X and Y, each of which 
take values in finite sets X and y, and both nodes want to compute 
some function f(X, Y) with zero error. 

Definition 3 (Fooling Set). A set E C X x y is said to be a fooling 
set, if for any two distinct elements (x\,y\), (x 2 ,y 2 ) in E, we have 
either 

• f{xx,V\) + f(x 2 ,y 2 ), or 

• f(%l,yi) = f(x 2 ,y 2 ), but either f(xi,y 2 ) / f(xi,Vi) or 
f(x 2 ,yi) / f(xi,yi). 

Given a fooling set E for a function f{X\,X 2 ), we have 
C(f(Xi,X 2 )) > log 2 \E\, The extension to multi-dimensional 
fooling sets is straightforward and gives a lower bound on the 
communication complexity of the function f(Xi, X 2 , . . . , X n ). 

The above approach can be extended to the general AND function 
of n variables, to obtain C(A(Xi, X 2 , . . . , X n )) = log 2 (n + 1). 
Further, one can derive the broadcast communication complexity of 
a more general class of functions, called threshold functions, which 
includes AND as a special case |45|. 



Definition 4 (Boolean threshold functions). A Boolean threshold 
function He(Xi, X 2 , . . . , X n ) is defined as 



U e (Xi,X 2 



,X n ) 



1 <7E,*> 

otherwise. 



Theorem 15. The complexity of computing a Boolean threshold 
function is C(TLe{Xi,X 2 , . . . X n )) = log 2 ( U \ 

We now turn to Boolean interval functions, for which we only 
know the approximate complexity. 

Definition 5 (Boolean interval function). A Boolean interval function 
ri[a,6] (X\, . . . , X n ) is defined as: 



rl[a,6] (Xl,X 2 , • • ■ , X n ) 



1 ifa<J2 t X,<b 
otherwise. 



A naive strategy to compute the function IIuMpfi, . . . , X n ) 
is to compute the threshold functions IL a (Xi, . . . , X n ) and 
Hb+i (Xi, X 2 , . . . , X n ). However, this strategy gives us more in- 
formation than we seek, i.e., if J]\ Xi £ [a, b] c , then we also know 
if yV Xi < a, which is superfluous information and perhaps costly 
to obtain. Alternately, we can derive a strategy which explicitly deals 
with intervals, as against thresholds. This strategy has significantly 
lower complexity |45|. 

Theorem 16. The complexity of computing a Boolean interval 

function TL\ a ,b] (Xi, X 2 , , X n ) with a + b < n is bounded as 

follows: 



log 2 



n + 1 
6 + 1 



+ 



n 

a - 1 



< c(n[ ai t](Xi, x 2 , . . . x„)) 



n+l 
6+1 



+ (6-a + l) 



n 

a- 1 



(12) 



The complexity of computing a Boolean interval function 
n[ a ,6](^i, • ■ • ,X„) with a + b > n is bounded as follows: 



log 2 



n + l 



< log 2 



+ 



n 

6+1 



< c(ri[ ai6 ] (x 1 ,x 2 , . . . x„)) 

(13) 



' * + (6 - a + 1) 1 " 



6+1 



D. Information theoretic formulation 

To study the ultimate performance limits of function computation 
and optimal function computation strategies, one again needs to turn 
to an information theoretic formulation. There are two features that 
can be incorporated in this most general formulation. First, we can 
allow for a vanishing error of computation, in contrast with the 
previous formulations which consider zero-error block computation. 
Second, we can exploit the correlation in sensor measurements, 
and achieve higher efficiency. However, there are very few results 
for this most general framework. We now review some of the 
basic information theoretic results, which lead to a more general 
formulation that incorporates computation over wireless networks. 

Consider two sensors with measurements X and Y drawn accord- 
ing to the joint distribution p(X, Y). The sensors take measurements 
in each time slot that are jointly correlated, but temporally inde- 
pendent and identically distributed. The two sensors are connected 
through noiseless independent links to a receiver, to which the 
measurements need to be communicated. The rates of the two links 
are Ri and R 2 respectively. We wish to determine the rate region, 
i.e., the set of all possible pairs of rates (Ri,R 2 ) at which the 
sources can be individually compressed and sent to the receiver, such 
that the receiver can reconstruct the original sources with vanishing 
probability of error. This problem of decentralized compression of 
correlated sources is known as the Slepian-Wolf problem 147 1 . 

The key challenge in this problem is to exploit the correlation in 
the measurements, even though the two sensors perform distributed 
compression. If both sensors had access to each others measurements, 
they would together still need to communicate at least H(X, Y) bits, 
i.e, Ri + R 2 > H(X,Y). Further, if X supposes that the receiver 
has full knowledge of Y, it still needs to transmit H(X\Y) bits, 
i.e., R 1 > H(X\Y). Applying a similar argument for Y, we have 
R 2 > H(Y\X). The remarkable result of Slepian and Wolf is that 
this region is indeed achievable. The achievability strategy uses the 
technique of random binning to exploit correlation. The Slepian-Wolf 
problem can be easily extended to the case of multiple correlated 
sources communicating to a receiver. However, the extension to tree 
networks, with the receiver as root, has not been solved. 

Another interesting variation of the problem arises if we only 
desire the reconstruction of sources to some fidelity, i.e., the receiver 
wishes to recover estimates X, Y such that E[D(X, X')] < d, 
E[D(Y, Y')] < d, where D(-,-) is a given distortion measure. This 
problem is open as well. The special case of this problem in which 
one of the sources is known to the receiver as side information, and 
only the other is to be determined, was solved by Wyner and Ziv [48 1. 
It is important to note that function computation can be viewed as a 
special case of the rate-distortion problem, by defining an appropriate 
distortion metric that is function-dependent. For example, 

D(X,X) := \ f(X,Y)- f(X,Y)\ 2 . 

This has been extended in |49| to the case in which the receiver 
desires to know a certain function f(X, Y) of the single source X 
and the side information Y, and determined the required capacity 
of the channel between the source and receiver as being a function 
of the conditional graph entropy, which is a measure defined on the 
two random variables and the characteristic graph | 50| defined by the 
function /(■). Recently, there have been some extensions to the case 
of two nodes 1511 and to the case of tree networks [52|. However, a 
general single-letter characterization has remained elusive. 

Another interesting stand-alone problem was studied in [53 J, 
where two correlated binary sources need to encode their sequences 
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in distributed fashion, so that the receiver can compute the XOR 
function of the two sequences. It is shown that, in some cases, the 
sum rate required may be substantially less than the joint entropy of 
the two sources. This clearly displays the advantage of function-aware 
encoding strategies over standard distributed source coding. 

The above formulation considers a single round protocol, where 
only X communicates with Y. One can further seek the information 
theoretic limits for interactive computation. The rate region for 
multi-round interactive function computation has been characterized 
for two nodes in 1541 . and for collocated networks in [551 - The 
characterization closely resembles the Wyner-Ziv result and some 
interesting connections with communication complexity are made. 

E. Modeling channel noise 

All the above models assume that the channel is a noiseless wired 
link, possibly after the medium access control problem is solved. 
However, in practice, all channels are noisy, and one needs to study 
the rate of computation over noisy channels. In 1561 , the problem 
of computing parity in a broadcast network was studied, assuming 
independent binary symmetric channels between each pair of nodes. 
In the noiseless case, one would require n bits, and in the noisy 
case, by using repetition coding, we require ®(n\ogn) bits for low 
probability of error. However, this does not make use of the broadcast 
nature of wireless medium. For each bit that is transmitted, all the 
other nodes in the network hear a noisy version of it. The key 
contribution of 1 56 1 is a strategy which systematically uses this side- 
information to compute the parity function. The hierarchical strategy 
proposed requires only O(nloglogn) bits. Remarkably, this upper 
bound of n log log n was shown to be sharp in 1571 . using noisy 
binary decision trees. 

In 1 58 1, the problem of distributed symmetric function computation 
in noisy wireless networks is considered. It is shown that the energy 
usage for computing a symmetric function in a network with binary 

symmetric channels is ^n(loglogn) fy^jpM where a is 

the path loss exponent. 

While the information theoretic formulation described above only 
considers distributed source coding, one would like to generalize 
the formulation to incorporate a more general channel model. For 
example, consider two sources SI and S2, which have access to 
channel inputs X and Y of a multiple access channel, with output 
Z = X + Y + N being available to a receiver, where TV is Gaussian 
noise. The receiver wishes to compute the sum X + Y . The question 
of interest is to find the optimal power-distortion curve, i.e., for a 
given pair of transmit powers PI and P2, what is the minimum 
distortion D at which the sum X + Y can be communicated to 
the receiver. Thus, the channel operation itself can be viewed as an 
implicit computation as argued in 1591 . 

Finally, the above described solution to the medium access problem 
can be described broadly as interference avoidance, since we assume 
that collisions do not convey information. We could generalize the 
formulation further by assuming an interference network. However, 
the solution to such a problem is very far from the current frontiers 
of knowledge in information theory. 

VI. Conclusions 

We have presented some foundational results for sensor networks 
related to connectivity, capacity, clocks and computation. The fun- 
damental approach studied is that of focusing on large networks 
and obtaining understanding through asymptotics. We have provided 
necessary and sufficient conditions that guarantee asymptotic connec- 
tivity in a wireless sensor network. We have presented the asymptotics 
on the capacity of wireless networks derived from both geometric and 
network information theoretic models and briefly discussed protocol 
implications. Next, we presented fundamental impossibility results 
on synchronizing the simplest class of affine clocks in networks, 
under idealistic assumptions, while fully characterizing the uncer- 
tainty set. We have presented a decentralized scheme for smoothing 



estimates of pairwise clock offsets. Finally, we have have addressed 
the broad problem of in-network function computation. We have 
presented optimal strategies for general functions on tree graphs, and 
studied tree aggregation in general graphs. We have also presented 
a classification of functions based on complexity of computation, 
and the exact complexity of computing certain Boolean functions in 
collocated networks. These results provide a basis for designing and 
analyzing large sensor networks. 
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